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USE OF GENETIC POLYMORPHISMS TO PREDICT 
DRUG-INDUCED HEPATOTOXICITY 

FIELD OF THE INVENTION 

[0001] This invention relates generally to the analytical testing of tissue samples in 
vitro, and n:iore particularly to the analysis of genetic polymorphisms as biomarkers for 
predicting the occurrence of drug-induced hepatotoxicity. 

DESCRIPTION OF THE RELATED ART 

[0002] Among the disorders that arise from dysfunction of the microvasculature of 
diabetic patients is retinopathy, which manifests itself clinically as vision impairment and 
can result in blindness. Diabetic retinopathy is characterized by microaneuiysms, excessive 
vascular permeability, areas of retinal nonperfiision, and retinal neovascularization. Much 
evid^ce suggests a causal link between higih blood glucose levels and the development of 
the underlying lesions responsible for deficits in organ function. For a review, see Way KJ 
et al, Diabetic Medicine 18: 945-59 (2001). 

[0003] Among the effects of hyperglycemia is the over-activation of the diacylglycerol 
(DAG) - protein kinase C (PKC) signal transduction pathway. Both cell culture 
experiments and animal models of diabetes demonstrate excessive levels and activity of 
DAG and PKC in vascular endothelial cells. Koya D & King GL, Diabetes 47: 859-866 
(1998); Ishii H et aL, JMolMed 76: 21-31 (1998) and Way KJ et aL, Trends Pharmacol 
Sci 21: 181-7 (2000). Activation of many of the isoforms of PKC serine-threonine kinases 
is dependent on DAG, a cleavage product of membrane phospholipids. Among the 
activated isoforms of PKC serine-tibreonine kinases is the predominant isoform PKC(i, 
which is associated with diabetic retinopathy. DAG is usually generated by 
agonist-stimulated hydrolysis of membrane phospholipids, but also can be synthesized '^Ke 
novo by direct metabolism of glucose. Dxmlop ME & Larkins RG, Biochem Biophys Res 
Commun 132: 467-73 (1985); IshiiH a/., JMolMed76: 21-31 (1998). In response to 
hyperglycemia, de novo synthesis of DAG increases substantially, resulting in the 
activation of PKC/3. Ishii H et al, JMol Med 76: 21-31 (1998). 
[0004] As a consequence of continual activation of the DAG-PKC pathway, many 
aspects of vascular function are affected. Cytokine activation and leukocyte adhesion are 
stimulated; blood flow and microvessel contractility are altered; and extracellular matrix 
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synliiesis increases, resulting in the thickening of basement membranes. The retinal 
microenviroimient becomes ischemic as a result of the aforementioned changes. Expression 
of vascular endothelial growtti factor (VEGF), a potent stimulator of neovascularization, is 
upregulated in response to ischemia and by other PKCp-dependent mechanisms. Aiello DP 
et aL, Diabetes 46: 1473-80 (1997). 

[0005] N-benzoyl-staurosporine (PKC412) is an inhibitor of both PKC and an essential 
VEGF receptor, KDR (Kinase insert Domain-containing Receptor, also known as 
VEGF-R2). N-benzoyl-staurosporine is being developed for several indications, including 
the treatment of diabetic macular edema. See, U.S. Pat No. 6,214,819. See also, U.S. Pat. 
Apphi. 20030119812, 20030125343 and 20030153551. Although a promising medication, 
treatment with N-benzoyl-staurosporine can result in known side effects, including hver 
toxicity. Thus, there is a need in the art for reducing the side effects of side effects of 
N-benzoyl-stauFosporine. 

SUMMARY OF THE INVENTION 

[0006] The invention provides me^ods for determining subjects who are at risk for 
developing drug-induced hepatotoxicity. In one embodiment, the invention provides for the 
use of genomic analysis to identify patients at risk for e^eriencing hepatotoxicity during 
staurosporine therapy. In a particular embodiment, the staurosporine therapy involves the 
administration of N-benzoyl-staurosporine for treating diabetic macular edema. The 
hepatotoxicity prediction involves the determination of serum aspartate transaminase (AST) 
levels. In another embodiment, the inv^tion provides methods for determining optimal 
treatment strategies for these patients. 

[0007] The invention also provides clinical assays, kits and reagents for predicting 
hepatotoxicity prior to taking a drug. In one CTibodiment, the kits contain reagents for 
determining genetic polymorphisms in the JLIA gene. In a particular embodiment, the ^ 
genetic polymorphism is at the PG locus ID 279 of theiLi^ gene. In assays of genetic 
polymorphism of PG locus ID 279, the CC genotype (SEQ ID NO:l) is a biomarker for 
predictions of higher risk of hepatotoxicity, while the CT genotype (SEQ ID NOS:l and 2) 
and TT genotype (SEQ ID NO:2) are biomarkers for a lower risk of hepatotoxicity. In 
another embodiment, the kits contain reagents for determining genetic polymorphisms in 
the ILIA gene. In a particular embodiment, the genetic polymorphism is at the PG locus ID 

302 of the ILIA gene. In assays of genetic polymorphism of PG locus ID 302, the GG 
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genotype (SEQ ID NO:3) is a biomarker for predictions of higher risk of hepatotoxicity, 
while the GT genotype (SEQ ID NOS:3 and 4)and TT genotype (SEQ ID NO:4) are 
biomarkers for a lower risk of hepatotoxicity. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0008] FIG. 1 shows the AST/ALT (aspartate aminotransferase / alanine 
aminotransferase) maximum levels vs. ILIA (interleukin 1 -alpha; PG locus ID 279). The 
scatter plots show (A) maximiim aspartate aminotransferase levels, (B) ratio of ASTMAX 
and the upper limit of normal (ULN), (C) maximum alanine aminotransferase levels and 
(D) ratio of ALT MAX and ULN for subjects in the clinical trial with genotypes of CC or T 
(CT or TT) for ILIA PG locus ID 279. The upper limit of normal for aspartate 
aminotransferase is 42 U/L for ages 3-64 and 55 U/L for 65 and over and for alanine 
an:iinotransferase is 48 U/L for all ages. ULNs are indicated by a line. 
[0009] FIG. 2 shows the AST/ALT (aq)artate aminotransferase / alanine 
aminotransferase) maximum levels vs. ILIA (interleukin l-alph€^ PG locus ID 302). The 
scatter plots show plots of (A) maximum alanine aminotransferase levels, (B) ratio of 
ASTMAX and ULN, (C) maximum ALT levels and (D) ratio of ALT MAX and ULN for 
subjects in the clinical trial with genotypes of G or T (GT or TT) for ILIA PG locus ID 
302. The upper limit of normal for aspartate aminotransferase is 42 U/L for ages 3-64 and 
55 U/L for 65 and over and for alanine aminotransferase is 48 U/L for all ages. ULNs are 
indicated by a line. 

DETAILED DESCRIPTION OF THE INVENTION 

[001 0] The invention advantageously provides a way to determine whether a patient 
will experience hepatotoxicity during drug treatment, prior to actually taking the drugs. The 
invention thus provides safer treatment regunens for patients by helping clinicians to either 
(1) alter the dose of the drug, (2) provide additional or alternative conconoitant medication 
or (3) choose not to prescribe that dmg for that patient. 

[001 1] Relevant genetic polymorphisms were identified in a multicenter, randomized, 
double-masked placebo-controlled dose finding phase II trial of N-benzoyl-stauroq)orine, 
which was conducted in subjects with diabetic macular edema. The safety of 
N-benzoyl-staurosporine was evaluated in the subjects and additional pharmacokinetic 
information on was collected. While N-benzoyl-staurosporine showed a good safety profile, 
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nine of the 140 subjects that were enrolled in the clinical trial experienced hepatotoxicity, 
as defined by fold increases in liver transaminases over the upper limit of normal (ULN)- 
Subjects were flagged as having experienced hepatotoxicity if either aspartate 
aminotransferase (AST) or alanine aminotransferase (ALT) had fold elevations over the 
upper limit of normal on visit 3, 4 or 5. 

[0012] Of the eighteen single nucleotide polymorphisms (SNP) from seven genes that 
were genotyped, two in the interleukin 1-alpha {ILIA) gene were associated with the 
maximum serum aspartate transaminase level recorded oh visits 3, 4 or 5. ILIA encodes an 
inflammatory cytokine that plays a pivotal role in mediating acute phase responses. One 
ILIA polymorphism is located in the promoter region of ILIA and the other results in a 
serine to alanine substitution at amino acid position 1 14. These results suggest that 
polymorphisms in ILIA or a gene located near it on 2ql4, may be directly involved with the 
onset of liver toxicity following, administration of N-benzoyl-staurosporine. 
[0013] As used herein, a polymorphism in the ILIA genetic locus is **predictive" of a 
"high" risk of hepatotoxicity when genetic polymorphism correlates significantly wifli the 
development of drug-induced hepatotoxicity or with elevated levels of serum a^artate 
transaminase. See, for example, below, where the CC genotype at the PG locus ID 279 and 
the GG genotype at the PG locus ID 302 are predictive of a high risk of hepatotoxicity. As 
used herein, a polymorphism in the ILIA genetic locus is **predictive" of a "low" risk of 
hepatotoxicity when genetic polymorphism correlate significantly with liie lack of 
development of hepatotoxicity. See, for example, below, where the CT or TT genotype at 
the PG locus ID 279 and the GT or TT genotype at the PG locus ID 302 are predictive of a 
low risk of hepatotoxicity. Determinations of significance (p values) can be determined by 
analysis of variance (ANOVA) or Fisher's Exact tests. Determinations of one SNP 
polymorphism at a certain ILIA genetic site as having a high risk for developing 
hepatotoxicity and another SNP polymorphism at that ILIA genetic site as having a low 
risk for developing hepatotoxicity can be combined for greater accuracy of determination. 
For PG locus IDs 279 and 302, associations between ILIA polymorphisms and serum 
aspartate transaminase levels had p values of 0.0089 and 0.0097. 

[0014] These results can reasonably be extrapolated to the prediction of hepatotoxicity 
in patiCTts following the administration of any staurosporine derivatives, based upon the 
structural similarity and modes of action in the liver of staurosporine derivatives. Among 
the staurosporine derivatives are those described in U.S. Pat. No. 5,093,330. Preferred 
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compounds are N-acylstaurospoiines and their phannaceutically acceptable salts, including 
N-(2-aminoacetyl)stauroq)orine;N-(3,5-diintrobenzoyl)-staurospor^ 
N-(3-carboxypropionyl)staurosporine;N-(3-fluorobenzoyl)-staurosporine; 
N'<3^nitrobenzoyl)staiux)sporine;N-(4-carboxybenzoyl)staurosporine; 
N"[(tert-butoxycaibonylainino)-acetyl]-staurosporine; N-alanylstaurosporine; N-benzoyl 
staurosporine; N-carboxymethyl-staurosporine; N-ethyl-staurosporine; 
N-methylaminothiocarbonylstaurosporine; N-phenylcarbamoylstaiux)sporine; 
N-tert-butoxycarbonylstaurosporine; and N-trifluoracetylstaurosporine. 
[001 5] Moreover, the results can be extrapolated to the prediction of hepatotoxicity in 
patients who are being treated for diseases other than diabetic macular edema. The method 
of the invention is applicable to vertebrate subjects, particularly to mammalian subjects, 
more particularly to human subjects. The invention is particularly applicable to diabetic 
subjects. 

[0016] The diagnosis of hepatotoxicity can be accomplished using assays of serum 
enzyme levels. Serum enzyme assays indicative of liver dysfunction are well*-knovm to 
those of skUl in the medical arts and routine in ho^ital laboratories. For a definition of 
hepatotoxicity based upon serum levels of aspartate transaminase (AST) and used in the 
EXAMPLE: The definition of hepatotoxicity used in this analysis was based on fold 
increases in serum aspartate transaminase or alanine aminotransferase over the upper limit 
of normal (ULN) on visits 3, 4 or 5. The upper limit of normal for serum aspartate 
transaminase is 42 U/L for ages 3-64 and 55 U/L for ages 65 and up; for serum alanine 
aminotransferase the upper limit of normal is 48 U/L (Smithkline Beecham Clinical 
Laboratories Reference Alert Ranges). While an elevation of either enzyme at visit 3, 4 or 5 
constituted hepatotoxicity, transaminase elevations were disregarded during subsequent 
visits when the dmg was not being administered. Furthermore, subjects who had elevated 
liver function tests at baseline (visit 2) were not flagged as having experienced 
h^atotoxicity, regardless of the elevation in their enzyme levels following administration 
of the drag. Nine subjects were flagged as haviag experienced hepatotoxicity. Of these, six 
consented to clinical pharmacogenetic analysis. 

[0017] Individuals carrying polymorphic alleles may be detected at the DNA, the 
RNA, or the protein level using a variety of techniques that are well known in the art. 
Strategies for identification and detection are described in e.g, EP 730,663, EP 717,1 13, and 
PCT US97/02102. The methods of the invention may mvolve the detection of 
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pre-characterized polymorphisms. That is, the genotyping location and nature of 
polymorphic forms present at a site have already been determined (see, discussion above 
regarding interrogated genes). The availability of this information allows sets of probes to 
be designed for specific identification of the known polymorphic forms. The identification 
of alleles containing single nucleotide polymorphisms may involve the amplification of 
DNA firom target samples. This can be accomplished by e.g., PGR. See generally PCR 
Technology: Principles and Applications for DNA Amplification, (ed. Erlich, Freeman 
Press, New York, New York, 1992); PCR Protocols: A Guide to Methods and Applications 
(eds. Innis, et aL, Academic Press, San Diego, Calif., 1990). The detection of 
polymorphisms in specific DNA sequences, can be accomplished by a variety of methods 
including, but not limited to, restriction-firagment-length-polymoiphism detection based on 
allele-specific restriction-endonuclease cleavage (Kan & Dozy, Lancet 11:910-912 (1978)), 
hybridization with allele-specific oligonucleotide probes (Wallace et at, NucL Acids Res. 
6:3543-3557 (1978)), including immobilized oligonucleotides (Saiki ot al, Proc. Natl 
Acad. Set. USA, 86:6230-6234 (1969)) or oligonucleotide arrays (Maskos & Southern, 
Nucl Acids Res. 21:2269-2270 (1993)), allele-specific PCR (Newton et al, Nucl Acids 
Res. 17:2503-2516 (1989)), mismatch-repair detection (MRD) (Faham & Cox, Genome 
Res, 5:474-482 (1995)), binding of MutS protein (Wagner et al, Nucl Acids Res. 
23:3944-3948 (1995), denaturing-gradient gel electrophoresis (DGGE) (Fisher & Lerman, 
Proc. Natl Acad, Set U.SA. 80:1579-1583 (1983)), single-strand-conformation- 
polymorphism detection (Orita et al. Genomics 5:874-879 (1983)), RNAse cleavage at 
mismatched base-pairs (Myers et al. Science 230:1242 (1985)), chemical (Cotton et al, 
Proc. Natl Acad. Set U.SA., 8Z:4397-4401 (1988)) or enzymatic (Youil etal, Proc. Natl 
Acad. Sci. U.S.A. 92:87-91 (1995)) cleavage of heteroduplex DNA, methods based on allele 
specific primer extension (Syvanen et al. Genomics 8:684-692 (1990)), genetic bit analysis 
(GBA) (NikLforov et al, Nucl Acids Res. 22:4167-4175 (1994)), the 
oligonucleotide-ligation assay (OLA) (Landegren et al. Science 241:1011 (1988)), the 
allele-specific ligation chain reaction (LCR) (Barrany, Proc. Natl Acad. Set U.S.A. 
88:189-193 (1991)), gap-LCR (Abravaya et al, Nucl Acids Res. 23:675-682 (1995)), 
radioactive and/or fluorescent DNA sequencing using standard procedures well known in 
the art, and peptide nucleic acid (PNA) assays (Oram et al, Nucl Acids Res. 21:5332-5356 
(1993); Thiede e/ al, Nucl Acids Res. 24:983-984 (1996)). Additional guidance is provided 
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by Sambrook J et aL, Molecular Cloning: A Laboratory Manual, Third Edition (Cold 
Spring Harbor Press, Cold Spring Harbor, New York, 2000). 

[0018] Guidance for the use of N-benzoyl-staurosporine and related staurospaurine 
derivatives is provided in U.S. Pat. Nos. 5,744,460; 5,827,846; 6,018,042; 6,153,599 and 
6,214,819, each of which is incorporated by reference. Additional guidance for flie use of 
N-benzoyl-staurosporine related staurospaurine derivatives for treating ocular neo vascular 
diseases and in decreasing capillary permeability in the retina is provided in U.S. Pat. 
Appbis. 20030119812, 20030125343 and 20030153551, each of which is incorporated by 
reference. 

[0019] Single Nucleotide Polymo?j}hisms, Sequence variation in the human genome 
consists primarily of single nucleotide polymorphisms ("SNPs") with the remainder of the 
sequence variations being short tandem repeats (including micro-satellites), long tandem 
repeats (mini-satellite) and other insertions and deletions. A SNP is a position at which two 
alternative bases occur at sqppreciable frequency (i.e. >1%) in the human population. A SNP 
is said to be "allelic" in that due to the existence of the polymorphism, some members of a 
species may have the immutated sequence (i.e., the original "allele") whereas other 
members may have a mutated sequence (i.e., the variant or mutant allele). la the simplest 
case, only one mutated sequence may exist, and the polymorphism is said to be diallelic. 
The occurrence of altemative mutations can give rise to triallelic polymorphisms, etc. SNPs 
are widespread throughout the genome and SNPs that alter the function of a gene may be 
direct contributors to phenotypic variation. Due to their prevalence and widespread nature, 
SNPs have potential to be important tools for locating genes that are involved in human 
disease conditions, see e.g.^ Wang et aL, Science 280: 1077-1082 (1998), which discloses a 
pilot study in which 2,227 SNPs were mapped over a 2.3 megabase region of DNA. 
[0020] An association between a single nucleotide polymorphisms and a particular 
phmotype does not indicate or require that the SNP is causative of the phenotype. Instead, 
such an association may indicate only that the SNP is located near the site on the genome 
where the determining factors for the phenotype exist and therefore is more likely to be 
found in association with these deteraiining factors and thus with the phenotype of interest. 
Thus, a SNP maybe in linkage disequilibrium (LD) with the 'true* functional variant. LD, 
also known as allelic association exists when alleles at two distinct locations of the genome 
are more highly associated than expected. 



7 



wo 2005/040415 PCT/EP2004/011123 

[0021] Thus a SNP may serve as a marker that has value by virtue of its proximity to a 
mutation that causes a particular phenotype. 

[0022] SNPs that are associated with disease may also have a direct effect on the 
function of the gene in which they are located. A sequence variant may result in an annino 
acid change or may alter exon-intron splicing, thereby directly modifying the relevant 
protein, or it may exist in a regulatory region, altering the cycle of expression or tihe 
stability of the mRNA, see Nowotny P Current Opinions in Neurobiology 1 1:637-641 
(2001). 

[0023] It is increasingly clear that the risk of developing many common disorders and 
the metaboUsm of medications used to treat these conditions are substantially influenced by 
underlying genomic variations, although the effects of any one variant might be small. 
[0024] Therefore, an association between a SNP and a clinical phenotype suggests, (1) 
the SNP is functionally responsible for the phenotype or, (2) there are other mutations near 
the location of the SNP on the genome that cause the phenotype. The 2"^ possibility is 
based on the biology of inheritance. Large pieces of DNA are inherited and maikers in 
close proximity to each other may not have be^ recombined in individuals that are 
iinrelated for many generations, te., the markers are in linkage disequlibrium (LD). 
[0025] Identification and characterization of SNPs. Many dififerent techniques can be 
used to identify and characterize SNPs, including single-strand conformation 
polymorphism analysis, heteroduplex analysis by denaturing high-performance liquid 
chromatography (DHPLC), direct DNA sequencing and computational methods, see Shi 
MM, Clin Chem A1\\6A-\12 (2001). Thanks to the wealth of sequence information in 
public databases, computational tools can be used to identify SNPs in silico by aUgning 
independently submitted sequences for a given gene (either cDNA or genomic sequences). 
Comparison of SNPs obtained experimentally and by in silico methods showed that SS% of 
candidate SNPs found by SNPFinder(http://Ipgws.nci.iiih.gov:82/perl/snp/snp_cgi.pl) have 
also been discovered experimentally, see^ Cox et al. Hum Mutat 17:141-150 (2001). 
However, these in silico methods could only find 27% of trae SNPs. 
[0026] The most common SNP typing methods currently include hybridization, primer 
extension and cleavage methods. Each of these methods must be connected to an 
appropriate detection system. Detection technologies include fluorescent polarization, (see 
Chan X et al. Genome Res 9:492-499 (1999)), luminopGietric detection of pyrophosphate 
release (pyrosequencing), (see Ahmadiian A et aL, AnalBiochem 280:103-10 (2000)), 
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fluorescence resonance energy transfer (FRET)-based cleavage assays, DHPLC, and mass 
spectrometry, (see Shi MM, Clin Chem 47:164-172 (2001) and U.S. Pat No. 6,300,076 
Bl). Other methods of detecting and characterizing SNPs are those disclosed in U.S. 
Patents No. 6,297,018 Bl and 6,300,063 Bl. The disclosures of the above references are 
incorporated herein by reference in their entirety. 

[0027] In a particularly preferred embodiment the detection of the polymorphism can 
be accomplished by means of so called INVADER™ technology (available from Third 
Wave Technologies Inc. Madison, Wisconsin, USA). In this assay, a specific upstream 
"invader" oligonucleotide and a partially overlapping downstream probe together form a 
specific structure when bound to complementary DNA template. This structure is 
recognized and cut at a specific site by the Cleavase enzyme, and this results in the release 
of the 5* flap of the probe oligonucleotide. This fragment ihsn serves as the "invader" 
oligonucleotide with respect to synthetic secondary targets and secondary fluorescently 
labelled signal probes contained in the reaction mixture. This results in specific cleavage of 
the secondary signal probes by the Cleavase enzyme. Fluorescence signal is generated 
when this secondary probe , labelled with dye molecules capable of fluorescence resonance 
energy transfer, is cleaved. Cleavases have stringent requirements relative to the stmcture 
formed by the overlapping DNA sequences or flaps and can, therefore, be used to 
specifically detect single base pair mismatches immediately upstream of the cleavage site 
on the downstream DNA strand. See Ryan D et al. Molecular Diagnosis 4(2): 135-144 
(1999) and Lyamichev V et al Nature Biotechnology 17:292-296 (1999), see also US 
Patents 5,846,717 and 6,001,567 (the disclosures of which are incorporated herein by 
reference in tiieir entirety). 

[0028] In some embodiments, a composition contains two or more differently labelled 
genotyping oligonucleotides for simultaneously probing the identity of nucleotides at two 
or more polymorphic sites. It is also contemplated that primer compositions may contain 
two or more sets of allele-specific primer pairs to allow simultaneous targeting and 
amplification of two or more regions containing a polymorphic site. 
[0029] Genotyping oligonucleotides of the invention may also be immobilized on or 
synthesized on a solid surface such as a microchip, bead, or glass slide (see, 
WO 98/20020 and WO 98/20019). Such immobilized genotyping oligonucleotides maybe 
used in a variety of polymorphism detection assays, including but not limited to probe 
hybridization and polymerase extension assays. Immobilized genotyping oligonucleotides 
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of the invention may comprise an ordered array of oligonucleotides designed to rapidly 
screen a DNA sample for polymorphisms in multiple genes at the same time, 
[0030] An allele-specific oligonucleotide primw of the invention has a 3' terminal 
nucleotide, or preferably a 3' penultimate nucleotide, that is coinplem^ntary to only one 
nucleotide of a particular SNP, thereby acting as a primer for polymefase-mediated 
extension only if the allele containing that nucleotide is present. Allele-specific 
oligonucleotide primers hybridizing to either the coding or noncoding strand are 
contOTiplated by the invention. An ASO primer for detecting gene polymorphisms could be 
developed using techniques known to those of skill in the art. 

[003 1] . Other genotyping oligonucleotides of the invention hybridize to a target region 
located one to several nucleotides downstream of one of the novel polymorphic sites 
identified herein. Such oligonucleotides are useful in polymerase-mediated primer 
extension methods for detecting one of the novel polymorphisms described herein and 
therefore such genotyping oligonucleotides are referred to herein as **pruner-exteiision 
oligonucleotides". In a preferred embodimmt, the 3 -terminus of a primer-extmsion 
oligonucleotide is a deoxynucleotide complementary to the nucleotide located immediately 
adjacent to the polymorphic site. 

[0032] In another embodiment, the invention provides a kit comprising at least two 
genotyping oligonucleotides packaged in separate containers. The kit may also contam 
other components such as hybridization buffer (where the oligonucleotides are to be used as 
a probe) packaged in a separate container. Altematively, where the oligonucleotides are to 
be used to ampUfy a target region, the kit may contain, packaged in separate containers, a 
pol3ntnerase and a reaction buffer optimized for primer extension mediated by the 
polymerase, such as PGR. 

[0033] The above described oligonucleotide compositions and kits are useful in 
methods for genotyping and/or haplotyping the gene in an individual. As used herein, the 
terms " genotype" and " haplotype*' mean the genotype or haplotype containing the 
nucleotide pair or nucleotide, respectively, that is present at one or more of the novel 
polymorphic sites described herein and may optionally also include the nucleotide pair or 
nucleotide present at one or more additional polymorphic sites in the gene. The additional 
polymorphic sites may be currently known polymorphic sites or sites that are subsequently 
discovered. 
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[0034] One embodiment of the genotyping method involves isolating from the 
individual a nucleic acid mixture comprising the two copies of the gene, or a fragment 
thereof, that are present in the individual, and determming the identity of the nucleotide pair 
at one or more of the polymorphic sites in the two copies to assign a genotype to the 
individual. As will be readily understood by the skilled artisan, the two "copies" of a gene 
in an individual may be the same allele or may be different alleles. In a particularly 
preferred embodimra^t, the genotyping method comprises determining the identity of the 
nucleotide pair at each polymorphic site. 

[0035] Typically, the nucleic acid mixture is isolated from a biological sample taken 
from the individual, such as a blood sample or tissue sample. Suitable tissue samples 
include whole blood, semen, saliva, tears, uiine, fecal material, sweat, buccal smears, skin 
and hair* The nucleic acid mixture may be comprised of genomic DNA, mRNA, or cDNA 
and, in the latter two cases, the biological sample must be obtained from an organ in which 
flie gene is expressed. Furthermore it will be understood by the skilled artisan that mRNA 
or cDNA preparations would not be used to detect polymorphisms located in introns or in 
5' and 3' nontranscribed regions. If a gene fragment is isolated, it must contain the 
polymorphic site(s) to be genotyped. 

[0036] One embodiment of the haplotyping method comprises isolating from the 
individual a nucleic acid molecule containing only one of the two copies of the gene, or a 
fragment thereof, that is present in the individual and determining in that copy the identity 
of the nucleotide at one or more of the polymorphic sites in that copy to assign a haplotype 
to the individual. The nucleic acid may be isolated using any method capable of separating 
the two copies of the gene or fragment, including but not limited to, one of the methods 
described above for preparing isogenes, with targeted in ^nvo cloning being the preferred 
approach. As will be readily appreciated by those skilled in the art, any individual clone 
will only provide haplotype information on one of the two gene copies present in an 
individual. If haplotype information is desired for flie individual's other copy, additional 
clones will need to be examined. Typically, at least five clones should be examined to have 
more than a 90% probability of haplotyping both copies of the gene in an individual. In a 
particularly preferred embodiment, the nucleotide at each of polymorphic site is identified. 
[0037] In a preferred embodiment, a haplotype pair is determined for an individual by 
identifying the phased sequence of nucleotides at one or more of the polymorphic sites in 
each copy of the gene that is present in the individual. In a particularly preferred 
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embodiment, the haplotyping method comprises identifying the phased sequence of 
nucleotides at each polymorphic site in each copy of the gene. When haplotyping both 
copies of the gene, the identifying step is preferably performed with each copy of the gene 
being placed in separate containers. However, it is also envisioned that if the two copies are 
■ labelled witii different tags, or are otherwise separately distinguishable or identifiable, it 
could be possible in some cases to perform the method in the same container^ For example, 
if jfirst and second copies of the gene are labelled with different first and second fluorescent 
dyes, respectively, and an allele-specific oligonucleotide labelled with yet a third different 
fluorescent dye is used to assay the polymorphic site(s), then detecting a combination of the 
first and third dyes would identify the polymorphism in the first gene copy while detecting 
a combination of the second and third dyes would identify the polymorphism in the second 
gene copy, 

[0038] la both tiie genotyping and haplotyping methods, the identity of a nucleotide (or 
nucleotide pair) at a polymorphic site(s) may be determined by amplifying a target 
region(s) containing the polymorphic site(s) directly firom one or both copies of the gene, or 
firagment th^eo^ and the sequence of the amplified region(s) detmnined by conventional 
methods. It will be readily appreciated by the skilled artisan that only one nucleotide will be 
detected at a polymorphic site in individuals who are homozygous at that site, while two 
different nucleotides will be detected if the individual is heterozygous for that site. The 
polymorphism may be identified directly, known as positive-type identification, or by 
inference, referred to as negative-type identification. For example, where a SNP is known 
to be guanine and cytosine in a reference population, a site may be positively determined to 
be either guanine or cytosine for ail individual homozygous at that site, or both guanine and 
cytosine, if the individual is heterozygous at that site. Alternatively, the site may be 
negatively determined to be not guanine (and thus cytosine/cytosine) or not cytosine (and 
tiius guanine/guanine). 

[0039] In addition, the identity of tixe allele(s) present at any of the novel polymorphic 
sites described herein may be indirectly determined by genotyping a polymorphic site not 
disclosed herein that is in linkage disequilibrium with the polymorphic site that is of 
interest. Two sites are said to be in linkage disequilibrium if the presence of a particular 
variant at one site enhances the predictability of another variant at the second site (see, 
Stevens JC, Mol Diag 4:309-317 (1999)). Polymorphic sites in linkage disequilibrium with 
the presently disclosed polymorphic sites may be located in regions of the gene or in other 
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gefiomic regions not examined herein. Genotyping of a polymorphic site in linkage 
disequilibrium with the novel polymorphic sites described herein may be performed by, but . 
is not limited to, any of the above-mentioned methods for detecting the identity of the allele 
at a polymorphic site. 

[0040] The target region(s) may be amplified using any oligonucleotide-directed 
amplification method, including but not limited to polymerase chain reaction (PGR) (U.S. 
Pat. No. 4,965,188), ligase chain reaction (LCR) (Barany et ah, Proc Natl Acad Sci USA 
88:189-193 (1991); PCT patent application WO 90/01069), and oligonucleotide ligation 
assay (OLA) (Landegren et al, Science 241:1077-1080 (1988)). Oligonucleotides usefiil as 
primers or probes in such methods should specifically hybridize to a region of the nucleic 
acid that contains or is adjacent to the polymorphic site. Typically, the oligonucleotides are 
between 10 and 35 nucleotides in length and preferably, between IS and 30 nucleotides in 
length. Most preferably, the oligonucleotides are 20 to 25 nucleotides long. The exact 
length of the oligonucleotide will depend on many factors that are routinely considered and 
practiced by the skilled artisan. 

[0041] Other known nucleic acid amplification procedures may be used to amplify the 
target region including transcription-based amplification systems (U.S. Pat. No. 5,130,238; 
EP 329,822; U.S. Pat. No. 5,169,766, WO 89/06700) and isothemial methods (Walker et 

al., Proc Natl Acad Sci USA 89:392-396 (1992)). 

[0042] A polymorphism in the target region may also be assayed before or after 
amplification using one of several hybridization-based methods known in the art. Typically, 
allele-specific oligonucleotides are utilized in performing such methods. The allele-specific 
oligonucleotides may be used as differently labelled probe pairs, with one member of the 
pair showing a perfect match to one variant of a target sequence and the other member 
showing a perfect match to a different variant. In some embodiments, more than one 
polymorphic site maybe detected at once using a set of allele-specific oligonucleotides or 
oligonucleotide pairs. Preferably, the members of the set have melting temperatures within 
5*^0 and more preferably within 2^C, of each other when hybridizing to each of liie 
polymorphic sites being detected. 

[0043] Hybridization of an allele-specific oligonucleotide to a target polynucleotide 
may be performed with both entities in solution or such hybridization may be performed 
when either the oUgonucleotide or the target polynucleotide is covalently or noncovalently 
affixed to a soUd support. Attachment may be mediated, for example, by antibody-antigen 
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interactions, poly-L-Lys, streptavidin or avidin-biotin, salt bridges, hydrophobic 
intCTactions, chemical linkages, UV cross-linking baking, etc. Allele-specific 
oligonucleotides may be synthesized directly on the solid support or attached to llie solid 
support subsequent to synthesis. Solid-supports suitable for use in detection methods of the 
invention include substrates made of silicon, glass, plastic, paper and die like, which may 
be foraied, for example, into wells (as in 96-well plates), sUdes, sheets, m^branes, fibers, 
chips, dishes, and beads. The soUd support maybe treated, coated or derivatized to 
facilitate the immobilization of the allele-specific oligonucleotide or target nucleic acid. 
[0044] The genotype or haplotype for the gene of an individual may also be determined 
by hybridization of a nucleic sample containing one or both copies of the gene to nucleic 
acid arrays and subarrays such as described in WO 95/1 1995. The arrays would contain a 
battery of allele-specific oligonucleotides representing each of the polymorphic sites to be 
included in the genotype or haplotype. 

[0045] The identity of polymorphisms may also be determined using a mismatch 
detection technique, including but not limited to the RNase protection method using 
riboprobes (Winter et al, Proc Natl Acad Sci USA 82:7575 (1985); Meyers et al. Science 
230:1242 (1985)) and proteins which recognize nucleotide mismatches, such as the E. coli 
mutS protein (Modrich P. Ann Rev Genet 25:229-253 (1991)). Alternatively, variant alleles 
can be identified by single strand conformation polymorphism (SSCP) analysis (Orita et aL, 
Genomics 5:874-879 (1989); Humphries et aL, in Molecular Diagnosis of Genetic 
Diseases, R. Elles, ed., pp. 321-340 (1996)) or denaturing gradient gel electrophoresis 
(DGGE) (Wartell et al, Nucl Acids Res 18:2699-2706 (1990); ShefiBeld et al, Proc Natl 
Acad Sci USA 86:232-236 (1989)). 

[0046] A polymerase-mediated primer extension method may also be used to identify 
the polymozphism(s). Several such methods have been described in the patent and scientific 
literature and include the "Genetic Bit Analysis" method (WO 92/15712) and the ligase / 
polymerase mediated genetic bit analysis (U.S. Pat. No. 5,679,524). Related methods are 
disclosed in WO 91/02087, WO 90/09455, WO 95/17676, U.S. Pat. Nos. 5,302,509 and 
5,945,283. Extended primers containing a polymorphism may be detected by mass 
spectrometry as described in U.S. Pat No. 5,605,798. Another primer extension method is 
allele-specific PGR (Ruafio et al, Nucl Acids Res 17:8392 (1989); Ruafio et al, Nucl Acids 
Res 19:6877-6882 (1991); WO 93/22456; Turki etal, J Clin Invest 95:1635-1641 (1995)). 
Li addition, multiple polymorphic sites may be investigated by simultaneously ampUfying 
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multiple regions of the nucleic acid using sets of allele-specific primers as described in 
WO 89/10414. 

[0047] La a preferred embodiment, the haplotype frequency data for each 
ethnogeographic group is examined to determine whether it is consistent with Hardy- 
Weiriberg equilibrium. Hardy-Weinberg equilibrium (D.L. Hartl et al. Principles of 
Population Genomics, 3rd Ed. (Sinauer Associates, Sunderland, MA, 1997) postulates that 
the frequency of finding the haplotype pair H\/H2 is equal to Ph-w {H1/H2) = 2p{Hx) p (H2) 
if Hi 9^H2 and Ph^w (H1/H2) = p (Hi) p (H2) if Hi = H2. A statistically significant difference 
between the observed and expected haplotype frequencies could be due to one or more 
factors including significant inbreeding in the population group, strong selective pressure 
on the gene, sampling bias, and/or errors in the genotyping process. If large deviations from 
Hardy-Weinberg equilibrium are obs^ed in an ethnogeographic group, flie number of 
individuals in that group can be increased to see if the deviation is due to a sampling bias. If 
a larger sample size does not reduce the difference between observed and expected 
haplotype pair frequencies, then one may wish to consider h^lotyping the individual using 
a direct haplotyping method such as, for example, CLASPER System™ technology (U.S. 
Pat No. 5,866,404), SMD, or allele-specific long-range PGR (Michalotos-Beloin et al, 
Nucl Acids Res 24:4841-4843 (1996)). 

[0048] In one embodiment of this metibiod for predicting a haplotype pair, the assigning 
step involves performing the following analysis. First, each of the possible haplotype pairs 
is compared to the haplotype pairs in the reference population. Generally, only one of the 
haplotype pairs in the reference population matches a possible haplotype pair and that pair 
is assigned to the individual. Occasionally, only one haplotype represented in the reference 
haplotype pairs is consistent with a possible haplotype pair for an individual, and in such 
cases the individual is assigned a haplotype pair containing this known haplotype and a new 
haplotype derived by subtracting the known haplotype from the possible haplotype pair. In 
rare cases, either no haplotypes in the reference population are consistent with the possible 
haplotype pairs, or alternatively, multiple reference haplotype pairs are consistent witti the 
possible haplotype pairs. In such cases, the individual is preferably haplotyped using a 
direct molecular haplotyping method such as, for example, CLASPER System"™ 
technology (U.S. Pat, No. 5,866,404), SMD, or allele-specific long-range PGR 
(Michalotos-Belom et al, Nucl Acids Res 24:4841-4843 (1996)). 
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[0049] The invention also provides a method for determining the frequency of a 
genotype or haplotype in a population. The method comprises determining the genotype or 
the haplotype pair for the gene that is present in each member of the population, vs^herein 
Ihe genotype or haplotype comprises the nucleotide pair or nucleotide detected at one or 
more of the polymorphic sites in the gene, and calculating the frequency any particular 
genotype or haplotype is found in the population. The population may be a reference 
population, a family population, a same sex population, a population group, a trait 
population (e.g^., a group of individuals exhibiting a trait of interest such as a medical 
condition or response to a therapeutic treatment). 

[0050] In another aspect of the invention, frequency data for genotypes and/or 
haplotypes found ia a reference population are used in a method for identifying an 
association between a trait and a genotype or a haplotype. The trait may be any detectable 
phenotype, including but not limited to susceptibility to a disease or response to a 
treatment. The method involves obtaining data on the frequency of the genotype(s) or 
haplotype(s) of iaterest in a reference population as well as in a population exhibiting the 
trait. Frequency data for one or both of the reference and trait populations may be obtained 
by genotyping or haplotyping each individual in the populations usiag one of the methods 
described above. The haplotypes for the trait population may be determined directly or, 
altematively, by the predictive genotype to haplotype approach described above. 
[005 1] In another embodiment, the frequency data for the reference and/or trait 
populations is obtained by accessing previously determined frequency data, which may be 
in written or electronic form. For example, the frequency data may be present in a database 
that is accessible by a computer. Once the frequency data is obtained, the frequencies of the 
genotype(s) or haplotype(s) of interest in the reference and trait populations are compared. 
In a preferred embodiment, the frequencies of all genotypes and/or haplotypes observed in 
the populations are compared. If a particular genotype or haplotype for tiie gene is more 
frequent in the trait population tiian in the reference population at a statistically significant 
amount, then the trait is predicted to be associated with that genotype or h£Q)lotype. 
[0052] In a preferred embodiment statistical analysis is performed by the use of 
standard ANOVA tests witii a Bonferoni correction and/or a bootstrapping method that 
simulates the genotype phenotype correlation many times and calculates a significance 
value. When many polymorphisms are being analyzed a correction to factor maybe 
performed to correct for a significant association that might be found by chance. For 
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Statistical methods for use in the methods of this invention, see: Statistical Methods in 
Biology, 3'^ edition^ Bailey NTJ, (Cambridge Univ. Press, 1997); Introduction to 
Computational Biology^ Watemian MS (CRC Press, 2000) and Bioinformatics^ Baxevanis 
AD & Ouellette BFF editors (John Wiley & Sons, Inc., 2001), 
[0053] hi a preferred embodiment of the method, the trait of interest is a clinical 
response exhibited by a patient to some therapeutic treatment, for example, response to a 
drug targeting or response to a therapeutic treatment for a medical condition. 
[0054] In another embodiment of the invention, a detectable genotype or haplotype that 
is in linkage disequilibrium with the genotype or haplotype of interest may be used as a 
surrogate marker. A genotype that is in linkage disequilibrium with a genotype may be 
discovered by determining if a particular genotype or haplotype for the gene is more 
frequent in the population that also demonstrates the potential surrogate marker genotype 
than in the reference population at a statistically significant amount, then the marker 
genotype is predicted to be associated with that genotype or haplotype and tiien can be used 
as a surrogate marker in place of the genotype. 

[0055] Definitions. As used herein, '^medical condition" includes but is not lin:iited to 
any condition or disease manifested as one or more physical and/or psychological 
symptoms for which treatment is desirable, and includes previously and newly identified 
diseases and other disorders. 

[0056] As used herein, the term "clinical response" means any or all of the following: a 
quantitative measure of the response, no response, and adverse response (i.e., side eflfects). 
[0057] In order to deduce a correlation between clinical response to a treatment and a 
genotype or haplotype, data is obtained on the cUnical responses exhibited by a population 
of individuals who received the treatment, hereinafter the "clinical population". This 
clinical data may be obtained by analyzing the residts of a clinical trial that has already 
been run and/or the clinical data may be obtained by designing and carrying out one or 
more new clinical trials. 

[0058] As used herein, tiie term "clinical trial" means any research study designed to 
collect clinical data on responses to a particular treatment, and mcludes but is not limited to 
phase I, phase II and phase m clinical trials. Standard methods are used to define the 
patient population and to enroll subjects. 

[0059] It is preferred that the individuals included in the clinical population have been 
graded for the existence of the medical condition of interest. This grading of potential 
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patients could employ a standard physical exam or one or more lab tests. Alternatively, 
grading of patients could use haplotyping for situations where there is a strotfg correlation 
between haplotype pair and disease susceptibiUty or severity. 

[0060] The therapeutic treatment of interest is administered to each individual in the 
trial population and each individual's response to the treatment is measured using one or 
more predetermined criteria. It is contemplated that in many cases, the trial population will 
exhibit a range of responses and that the investigator will choose the number of responder 
groups (e.g.^ low, medium, high) made up by the various responses. In addition, the gene 
for each individual in the trial population is genotyped and/or haplotyped, which may be 
done before or after administering the treatment. 

[0061] After both the clinical and polymorphism data have been obtained, correlations 
between individual response and genotype or haplotype content are created. Correlations 
may be produced in several ways. In one method, individuals are grouped by their genotype 
or haplotype (or haplotype pair) (also referred to as a polymorphism group), and then the 
averages and standard deviations of clinical responses exhibited by the members of each 
polymorphism groiq) are calculated. 

[0062] These results are then analyzed to detenmne if any observed variation in 
clinical response between polymorphism groups is statistically significant Statistical 
analysis methods which may be used are described in L.D. Fisher & G. vaiiBeUe, 
Biostatistics: A Methodology for the Health Sciences (Wiley-lnterscience, New York, 
1993). This analysis may also include a regression calculation of which polymorphic sites 
in the gene give the most significant contribution to the differences in phenotype. 
[0063] A second melhod for finding correlations between h^lotype content and 
clinical responses uses predictive models based on enror-naiimiiizing optimization 
algorithms. One of many possible optimization algorithms is a genetic algorithm (R. 
Judson, "Genetic Algorithms and Their Uses in Chemistry" in Reviews in Computational 
Chemistry, Vol. 10, pp. 1- 73, K.B. Lipkowitz & D.B. Boyd, eds. (VCH Publishers, New 
York, 1997). Simulated annealing (Press et aL^ *lslumerical Recipes in C: The Ait of 
Scientific Computing", Cambridge University Press (Cambridge) 1992, Ch. 10), neural 
networks (E. Rich and K. Knight, "Artificial Intelligence", 2nd Edition (McGraw-Hill, New 
York, 1991, Ch. 18), standard gradient descent methods (Press et ah, supra Ch. 10), or 
other global or local optimization approaches (see discussion in Judson, supra) could also 
be used. 

18 



wo 2005/040415 



PCT/EP2004/011123 



[0064] Correlations may also be analyzed using analysis of variation (ANOVA) 
techniques to determine how much of the variation in the clinical data is explained by 
different subsets of the polymorphic sites in the gene. ANOVA is used to test hypotheses 
about whether a response variable is caused by or correlated with one or more traits or 
variables that can be measured (Fisher & vanBelle, supra, Ch. 10). 
[0065] From the analyses described above, a mathematical model may be readily 
constructed by the skilled artisan that predicts clinical response as a function of genotype or 
haplotype content 

[0066] The identification of an association between a clinical response and a genotype 
or haplotype (or haplotype pair) for the gene may be the basis for designing a diagnostic 
method to determine those individuals who will or will not re^ond to the treatment, or 
alternatively, will respond at a lower level and thus may require more treatment, 2.e,, a 
greater dose of a drug. The diagnostic method may take one of several forms: for example, 
a direct DNA test (i.e., genotyping or h^lotyping one or more of the polymorphic sites in 
the gene), a serological test, or a physical exam measurement. The only requirement is that 
there be a good correlation between the diagnostic test results and the underlying genotype 
or haplotype that is in turn correlated with the clinical response. In a preferred embodiment, 
this diagnostic method uses the predictive haplotyping method described above. 
[0067] A computer may implement any or all analytical and mathematical operations 
involved in practicing the methods of the present inv^tion. In addition, the computer may 
execute a program that generates views (or screens) displayed on a display device and with 
which the user can interact to view and analyze large amotmts of information relating to the 
gene and its genomic variation, including chromosome location, gene stracture, and gene 
family, gene e>q)ression data, polymorphism data, genetic sequence data, and clinical data 
population data (e.g., data on ethnogeographic origin, clinical responses, genotypes, and 
haplotypes for one or more populations). The polymorphism data described herein may be 
stored as part of a relational database (e.g., an instance of an Oracle database or a set of 
ASCn flat files). These polymorphism data may be stored on the computer's hard drive or 
may, for example, be stored on a CD-ROM or on one or more other storage devices 
accessible by the compute. For example, the data may be stored on one or more databases 
in communication with the computer via a network. 

[0068] In other embodiments, the invention provides methods, compositions, and kits 
for haplotyping and/or genotyping the gene in an individual. The compositions contain 
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oligonucleotide probes and primers designed to specifically hybridize to one or more target 
regions containing, or that are adjacent to, a polymorphic site. The methods and 
compositions for establishing the genotype or haplotype of an individual at the novel 
polymorphic sites described herein are useful for studying the effect of the polymorphisms 
in the etiology of diseases affected by the expression and function of the protein, studying 
the efficacy of drugs targeting , predicting individual susceptibility to diseases affected by 
the expression and function of the protein and predicting individual responsiveness to drugs 
targeting the gene product. 

[0069] In yet another embodiment, the invention provides a method for identifying an 
association between a genotype or haplotype and a trait In preferred embodiments, the trait 
is susceptibility to a disease, severity of a disease, the staging of a disease or response to a 
drug. Such methods have appUcabiUty in developing diagnostic tests and therapeutic 
treatments for all phaimacogenetic {^plications where there is the potential for an 
association between a genotype and a treatmmt outcome including efficacy measurements, 
PK measurements and side effect measurements. 

[0070] The invention also provides a computer system for storing and displaying 
polymorphism data determined for the gene; The compute system comprises a computer 
processing unit; a display; and a database containing the polymorphism data. The 
polymorphism data includes the polymorphisms, the genotypes and tiie haplotypes 
identified for the gene in a reference population. In a preferred embodimCTt, tiie computer 
system is capable of producing a display showing haplotypes organized according to their 
evolutionary relationships. 

[0071] Tn another aspect, the iavention provides SNP probes, which are useful in 
classifying people according to their types of genetic variation. The SNP probes according 
to the invention are oUgonucleotides, which can discriminate between alleles of a SNP 
nucleic acid in conventional allelic discrindnation assays. 

[0072] As used herein, a "SNP nucleic acid" is a nucleic acid sequence, which 
comprises a nucleotide that is variable within an otherwise identical nucleotide sequence 
between individuals or groups of individuals, thus, existing as alleles. Such SNP nucleic 
acids are preferably firom about 15 to about 500 nucleotides in length. The SNP nucleic 
acids may be part of a chromosome, or they may be an exact copy of a part of a 
chromosome, e.g., by amplification of such a part of a chromosome through PGR or 
through cloning. The SNP nucleic acids are referred to hereafter simply as "SNPs". The 
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SNP probes according to tiie invention are oligonucleotides that are complementary to a 
SNP nucleic acid. 

[0073] As used herein, the term "complementary" means ^actly complementary 
throughout the length of the oligonucleotide in the Watson and Crick sense of the word. 
[0074] In certain preferred embodiments, the oligonucleotides according to this aspect 
of the invention are complemCTrtary to one allele of the SNP nucleic acid, but not to any 
other allele of the SNP nucleic acid. Oligonucleotides according to this embodiment of the 
invention can discriminate between alleles of the SNP nucleic acid in various ways. For 
example, under stringent hybridization conditions, an oligonucleotide of appropriate length 
will hybridize to one allele of the SNP nucleic acid, but not to any oflaer allele of the SNP 
nucleic acid. The oligonucleotide may be labelled by a radiolabel or a fluorescent label. 
Alternatively, an oligonucleotide of appropriate length can be used as a primer for PGR, 
wherein the 3' temiinal nucleotide is coniplementary to one allele of the SNP nucleic acid, 
but not to any other allele. In this embodiment, the presence or absence of amplification by 
PGR determines the haplotype of the SNP nucleic acid. 

[0075] Genomic and cDNA fragments of the invention comprise at least one novel 
polymorphic site identified herein and have a length of at least 10 nucleotides and may 
range up to the full length of the gene. Preferably, a fragment according to tiie present 
invention is between 100 and 3000 nucleotides in length, and more preferably between 200 
and 2000 nucleotides in length, and most preferably between 500 and 1000 nucleotides in 
length. 

[0076] In describing the polymorphic sites identified herein reference is made to the 
sense strand of the gene for convenience. However, as recognized by the skilled artisan, 
nucleic acid molecules containing the gene may be complementary double stranded 
molecules and thus ref^ence to a particular site on the sense strand refers as well to the 
corresponding site on the complementary antisense strand. Thus, reference may be made to 
the same polymorphic site on eitiier strand aixd an oligonucleotide may be designed to 
hybridize specifically to either strand at a target region containing the polymorphic site. 
Thus, the invention also includes single-stranded polynucleotides that are complementary to 
the sense strand of the genomic variants described herein. 

[0077] In a preferred embodiment, such kit may further comprise a DNA sample 
collecting means. 



21 



wo 2005/040415 PCT/EP2004/011123 

[0078] In particular, the genotyping primer composition may comprise at least two sets 
of allele specific primer pairs. Preferably, the two genotyping oligonucleotides are 
packaged in separate containers. 

[0079] It is to be understood that the methods of the iavention described herein 
generally may further comprise the use of a kit according to the invention. Generally, the 
methods of the invention maybe performed ex-vzvo, and such ex-vivo methods are 
specifically contemplated by the present invention. Also, where a method of the invention 
may include steps that may be practiced on the human or animal body, methods that only 
comprise those steps which are not practiced on the human or animal body are specifically 
contemplated by the present invention. 

[0080] Efifect(s) of the polymorphisms identified herein on expression of may be 
investigated by preparing recombinant cells and/or organisms, preferably recombinant 
animals, containing a polymorphic variant of the gene. As used herdn, "expression" 
includes but is not limited to one or more of the following: transcription of the gene into 
precursor mKNA^ splicing and other processing of the precursor niRNA to produce mature 
mKNA; mRNA stabilit>^ translation of the mature mRNA into protein (including codon 
usage and tRNA availability); and glycosylation and/or other modifications of the 
translation product, if required for proper expression and fimction. 
[0081] To prepare a recombinant cell of the invention, the desired isogene may be 
introduced into the cell in a vector such that the isogene remains extrachromosomal. In such 
a situation, the gene will be expressed by the cell firom the extrachromosomal location. In a 
preferred embodiment, the isogene is introduced into a cell in such a way that it recombines 
with the endogenous gene present in the cell. Such recombination requires the occurrence 
of a double recombination event, thereby residting in the desired gene polymorphism. 
Vectors for the introduction of genes both for recombination and for extrachromosomal 
maintenance are known in the art, and any suitable vector or vector construct may be used 
in the invention. Methods such as electroporation, particle bombardment, calcium 
phosphate co-precipitation and viral transduction for introducing DNA into cells are known 
in the art; therefore, the choice of method may lie with the competence and preference of 
the skilled practitioner. 

[0082] Recombinant organisms, i.e., transgenic animals, expressing a variant gene are 
prepared using standard procedures known in the art. Preferably, a construct comprising the 
variant gene is introduced into a nonhuman animal or an ancestor of the animal at an 
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embryonic stage, i.e., the one-cell stage, or generally not later than about the eigiht-cell 
stage. Transg^c animals" carrying the constructs of the invention can be made by several 
methods known to those having skill in the art. One method involves transfecting into the 
embryo a retrovirus constructed to contain one or more insulator elements, a gene or genes 
of interest, and other components known to those skilled m tiie art to provide a complete 
shuttle vector harbouring the insulated gene(s) as a transgene, see e.g.^ U.S. Pat. No. 
5,610,053. Another method involves directly injecting a transgene into the embryo. A third 
method involves the use of embryonic stem cells. 

[0083] Examples of animals, into which the isogenes may be introduced include, but 
are not limited to, mice, rats, other rodents, and nonhuman primates (see "The Introduction 
of Foreign Genes into Mice" and the cited references therein. In: Recombinant DNA, Eds. J 
.D. Watson, M. Gilman, J. Witkowski, & M. Zoller; W JI. FreCTaan and Company, New 
York, pages 254-272). Transgenic animals stably expressing a human isogene and 
pit)ducing human protein can be used as biological models for studying diseases related to 
abnormal expression and/or activity, and for screening and assaying various candidate 
dmgs, compounds, and treatment regimens to reduce the symptoms or effects of these 
diseases. 

[0084] In practicing the present invention, many conventional techniques in molecular 
biology, microbiology and recombinant DNA are used. These techniques are well-known 
and are explained in, e.g., ^'Current Protocols in Molecular Biology", Vols. I-III, Ausubel, 
Ed. (1997); Sambrook et al, ''Molecular Cloning: A Laboratory Manual'', 2"*^ Ed., Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1989); "DNA Cloning: A 
Practical Approach", Vols. I and n. Glover, Ed. (1985); "Oligonucleotide Synthesis", Gait, 
Ed. (1984); ''Nucleic Acid Hybridization",lizmes&mg^, Eds. (1985); "Transcription 
and Translation", Hames & Higgins, Eds. (1984); "Animal Cell Culture", Freshney, Ed. 
(1986); "Immobilized Cells and Enzymes", JKL Press (1986); Perbal, "A Practical Guide to 
Molecular Cloning"; the series. Methods in Enzymol., Academic Press, Lie. (1984); "Gene 
Transfer Vectors for Mammalian Cells", Miller and Calos, Eds., Cold Spring Harbor 
Laboratory, NY (1987); sud Methods in Enzymology, Vols. 154 and 155, Wu & Grossman, 
and Wu, Eds,, respectively. 

[0085] The standard control levels of the gene expression product, thus determined in 
the different control groups, would then be compared with the measured level of an gene 
expression product in a given patient. This gene expression product could be the 
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characteristic mRNA associated with that particular genotype group or the polypeptide gene 
expression product of that genotype group. The patient could then be classified or assigned 
to a particular genotype group based on how similar the measured levels were compared to 
the control levels for a given group. 

[0086] As one of skill in the art will und^tand, there will be a certain degree of 
uncertainty involved in making this deterniination. Therefore, the standard deviations of the 
control group levels would be used to make a probabilistic determination and the methods 
of this invention would be applicable over a wide range of probabiUty based genotype 
group determinations. Thus, for example and not by way of limitation, in one embodiment, 
if the measured level of the gene expression product falls within 2.5 standard deviations of 
the mean of any of the control groups, then that individual may be assigned to that genotype 
group. In another embodiment if the measured level of the gene expression product falls 
within 2.0 standard deviations of the mean of any of the control groups then that individual 
may be assigned to that genotype group. In still another embodiment, if the measured level 
of the gene expression product falls within 1 .5 standard deviations of the mean of any of the 
control groups then that individual may be assigned to that genotype group. In yet anotiier 
embodiment, if the measured level of the gene expression product is 1.0 or less standard 
deviations of the mean of any of the control groups levels then that individual may be 
assigned to that genotype group. 

[0087] Thus this process will allow the determining, with various degrees of 
probability, which group a specific patient should be place in and such assignment to a 
genotype group would then determine the risk category into which the individual should be 
placed. 

[0088] Methods to detect and measure mRNA levels and levels of polypeptide gene 
expression products are well known in the art and include the use of nucleotide microarrays 
and polypeptide detection methods involving mass spectrometers and/or antibody detection 
and quantification techniques. See also. Human Molecular Genetics^ 2^^ Edition. Tom 
Strachan & Andrew, Read (John Wiley and Sons, Inc. Publication, NY, 1999). 
[0089] Furthermore, detection of the concentration of the polypeptide (protein) 
e3q)ression product of the gene in body fluids or tissues can be used to determine the 
presence or absence of the polymorphism, and the relative level of the polypeptide 
expression product can be used to determine if the polymorphism is present in a 
homozygous or heterozygous state and therefore the risk category of the individual. 
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[0090] As used herein, "medical condition" includes, but is not limited to, any 
condition or disease manifested as one or more physical and/or psychological symptoms for 
which treatment is desirable, and includes previously and newly-identified diseases and 
other disorders. 

[0091] As used herein the term "polymorphism" shall mean any sequence variant 
present at a firequency of >1% in a population. The sequence variant may be present at a 
frequency significantly greater than 1% such as 5% or 10 % or more. Also, the term may be 
used to refer to the sequence variation observed in an individual at a polymorphic site. 
Polymorphisms include nucleotide substitutions, insertions, deletions and microsatellites 
and may, but need not, result in detectable differences in gene expression or protein 
function. 

[0092] As used herein, the term "clinical response" means any or all of the following: a 
quantitative measure of the response, no response and adverse response, ue.^ side effects. 
[0093] As used herein the term "allele" shall mean a particular form of a gene or DNA 
sequence at a specific chromosomal location (locus). 

[0094] As used herein, the temi "genotype" shall mean an unphased 5' to 3* sequence 
of nucleotide pair(s) found at one or more polymorphic sites in a locus on a pair of 
homologous chromosomes in an individual. As used herein, genotype includes a full- 
genotype and/or a sub-genotype. 

[0095] As used herein, the term "polynucleotide" shall mean any RNA or DNA, which 
may be unmodified or modified RNA or DNA, Polynucleotides include, without limitation, 
single- and double-stranded DNA, DNA that is a mixture of single- and double-stranded 
regions, single- and double-stranded RNA, and RNA that is mixture of single- and double- 
stranded regions, hybrid molecules comprising DNA and RNA that may be single-stranded 
or, more typically, double-stranded or a mixture of single- and double-stranded regions. In 
addition, polynucleotide refers to triple-stranded regions comprising RNA or DNA or both 
RNA and DNA. The term polynucleotide also includes DNAs or RNAs containing one or 
more modified bases and DNAs or RNAs with backbones modified for stability or for other 
reasons. • 

[0096] As used herein the term "gene" shall mean a segment of DNA that contains all 
the information for the regulated biosynthesis of an RNA product, including promoters, 
exons, introns, and other imtranslated regions that control expression. 
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[0097] As used herein the term "polypeptide" shall mean any polypeptide comprising 
two or more amino acids joined to each other by peptide bonds or modified peptide bonds, 
i.e., peptide isosteres. Polypeptide refers to both short chains, commonly referred to as 
peptides, glycopeptides or oligomers, a^d to longer chains, generally referred to as proteins. 
Polypeptides may contain amino acids other than the 20 gene-eacoded amino acids. 
Polypeptides include amino acid sequences modified either by natural processes, such as 
post-translational processing, or by chemical modification techniques that are well known 
in the art. Such modifications are well described in basic texts and in more detailed 
monographs, as well as in a voluminous research litCTature. 

[0098] As used herein, the term "polymorphic site" shall mean a position within a 
locus at which at least two alternative sequences are found in a population, the most 
firequent of which has a firequency of no more than 99%. 

[0099] As used herein, the term 'Nucleotide pair" shall mean the nucleotides found at a 
polymorphic site on the two copies of a chromosome firom an individual. 
[00100] As used herein, the term 'phased" means, when applied to a sequence of 
nucleotide pairs for two or more polymotphic sites in a locus, the combination of 
nucleotides present at those polymorphic sites on a single copy of the locus is known. 
[00101] In order to deduce a correlation between clinical response to a treatment and a 
genotype or haplotype, it is necessary to obtain data on the clinical responses exhibited by a 
population of individuals who received the treatment, hereinafter the "cUmcal population". 
This clinical data may be obtained by analyzing the results of a clinical trial that has already 
been run and/or the clinical data may be obtained by designing and carrying out one or 
more new clinical trials. 

[00 102] As used herein, the term "clinical trial" means any research study designed to 
collect clinical data on responses to a particular treatment, and includes, but is not limited 
to, Phase I, n and m clinical trials. Standard methods are used to define the patient 
population and to enroll subjects. 

[00103] As used herein the term "locus" shall mean a location on a chromosome or 
DNA molecule corresponding to a gene or a physical or phenotypic feature. 
[00104] The therapeutic treatment of interest is administered to each individual in the 
trial population and each individuaVs response to the treatment is measmred using one or 
more predetermined criteria. It is contemplated that in many cases, the trial population will 
exhibit a range of responses and that the investigator will choose the number of responder 
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groups, e.g., low, medium and higji, made up by the various responses. Jn addition, the gene 
for each individual in the trial population is genotyped and/or haplotyped, which may be 
done before or after administering the treatment. 

[00105] ITie Detection of Nucleic Acids and Proteins as Markers. Jn a particular 
embodiment, the level of mRNA corresponding to the marker can be determined both by in 
situ and by in vitro formats in a biological sample using methods known in the art. The 
term "biological sample" is intended to include tissues, cells, biological fluids and isolates 
thereof, isolated from a subject, as well as tissues, cells and fluids present within a subject. 
Many expression detection methods use isolated RNA. For in vitro methods, any RNA 
isolation technique that does not select against the isolation of mRNA can be utilized for 
the purification of RNA from cells. See, e.g,, Ausubel et aL, Ed., Curr. Prot. MoL Biol^ 
John Wiley & Sons, NY (1987-1999). Additionally, large numbers of tissue samples can 
readily be processed using techniques well-known to those of skill in the art, such as, e.g., 
the single-stq> RNA isolation process of U.S. Pat. No. 4,843,155. 
[001 06] The isolated mRNA can be used in hybridization or ampUfication assays that 
include, but are not limited to. Southern or Northern analyses, PGR analyses and probe 
arrays. One preferred diagnostic method for the detection of mRNA levels involve 
contacting the isolated niRNA with a nucleic acid molecule Oprobe) that can hybridize to 
the mRNA encoded by the gene being detected. The nucleic acid probe can be, e.g., a ftill- 
length cDNA, or a portion thereof, such as an oligonucleotide of at least 7, 15, 30, 50, 100, 
250 or 500 nucleotides in length and sufficient to specifically hybridize imder stringent 
conditions to a mRNA or genomic DNA encoding a marker of the present invention. Other 
suitable probes for use in the diagnostic assays of the invention are described herein. 
Hybridization of an mRNA with the probe indicates that the marker in question is being 
e:qpressed. 

[00107] In one format, the mRNA is immobilized on a solid surfiice and contacted with 
a probe, for example, by running the isolated mRNA on an agarose gel and transferring the 
mRNA from the gel to a membrane, such as nitrocellulose, hi an alternative format, the 
probe(s) are immobilized on a solid surface and the mRNA is contacted with the probe(s), 
for example, in an Affymetrix gene chip array. A skilled artisan can readily adapt known 
mRNA detection methods for use in detecting the level of mRNA encoded by the markers 
of the present invention. 
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[00108] An altOTiative method for detennining the level of mKNA corresponding to a 
marker of tiie present invention in a sample involves the process of nucleic acid 
amplification, e.g., by RT-PCR (the experimental embodiment set forth in Mullis, U.S. Pat. 
No,.4,683,202 (1987); ligase chain reaction, Barany (1991), supra; self-sustained sequence 
replication, Guatelli et aL, Proc. Natl Acad. Set USA, Vol. 87, pp. 1874-1878 (1990); 
transcriptional amplification system, Kwoh et al, Proc. Natl Acad. Set USA, Vol. 86, pp. 
1 173-1 177 (1989); Q-Beta Replicase, Lizardi et al, Biol Technology, Vol. 6, p. 1 197 
(1988); rolling circle replication, U.S. Pat. No. 5,854,033 (1988); or any other nucleic acid 
ampUfication method, followed by the detection of the amplified molecules using 
techniques well-known to those of skill in the art. These detection schemes are especially 
useful for the detection of the nucleic acid molecules if such molecules are present in very 
low numbers. As used herein, amplification primers are defined as being a pair of nucleic 
acid molecides that can aimeal to 5* or 3' regions of a gene ^lus and minus strands, 
respectively, or vice-versa) and contain a short region in between. In general, amplification 
primers are firom about 10-30 nucleotides in length and fiank a region &om about SO-200 
nucleotides in length. Under qspropriate conditions and with appropriate reagents, such 
primers permit the amplification of a nucleic acid molecule comprising the nucleotide 
sequence fianked by the primers. 

[00109] For in situ methods, mRNA does not need to be isolated form fiie cells prior to 
detection. In such methods, a cell or tissue sample is prepared/processed using known 
histological methods. The sample is then immobilized on a support, typically a glass slide, 
and then contacted with a probe that can hybridize to mRNA that encodes the marker. 
[001 10] As an altemative to making determinations based on the absolute expression 
level of the marker, determinations may be based on the normalized expression level of the 
marker. Expression levels are normalized by correcting the absolute expression level of a 
marker by comparing its expression to the expression of a gene that is not a marker, e.g., a 
housekeeping gene that is constitutively expressed. Suitable genes for normalization 
include housekeeping genes, such as the actin gene or epitiieUal cell-specific genes. This 
normalization allows the comparison of the expression level in one sample, e.g., a patient 
sample, to another sample or between samples firom difierent sources. 
[001 1 1 ] Alternatively, the expression level can be provided as a relative expression 
level. To determine a relative expression level of a marker, the level of expression of the 
marker is determined for 10 or more samples of normal versus disease biological samples, 
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preferably 50 or more samples, prior to the determiiiation of the expression level for the 
sample in question. The mean expression level of each of the genes assayed in the largo: 
number of samples is detomined and tiiis is used as a baseline expression level for the 
marker. The expression level of the marker determined for the test sample (absolute level of 
expression) is then divided by the mean expression value obtained for that marker. This 
provides a relative expression level. 

[001 12] Preferably, the samples used in the baseline determination will be from patients 
who do not have the polymorphism. The choice of the cell source is dependent on the use 
of the relative expression level. Using expression found in normal tissues as a mean 
expression score aids in validating whether the marker assayed is specific (versus normal 
cells). In addition, as more data is accumulated, the mean expression value can be revised, 
providing improved relative expression values based on accumulated data. 
[001 13] Detection of Polypeptides, In another embodiment of the present invention, a 
polypeptide corresponding to a marker is detected. A preferred agent for detecting a 
polypeptide of the invention is an antibody capable of binding to a polypeptide 
corresponding to a marker of the invention, preferably an antibody with a detectable label. 
Antibodies can be polyclonal, or more preferably, monoclonal. An intact antibody, or a 
fragment thereof, e.^.. Fab or F(3b% can be used. The term "labelled", witii regard to the 
probe or antibody, is intended to encompass direct-labelling of the probe or antibody by 
coupling, i.e., physically linking, a detectable substance to the probe or antibody, as well as 
indirect-labelling of the probe or antibody by reactivity with another reagent that is directly- 
labelled. Examples of indirect labelling include detection of a primary antibody using a 
fluorescently-labelled secondary antibody and end-labelling of a DNAprobe with biotin 
such that it can be detected with fluorescently-labelled streptavidin. 
[00 1 14] Proteins from individuals can be isolated using techniques that are well-known 
to those of skill in the art. The protein isolation methods employed can, e.g., be such as 
those described in Harlow & Lane (1988), supra. 

[001 15] A variety of formats can be employed to determine whether a sample contains a 
protein that binds to a given antibody. Examples of such formats include, but are not 
limited to, EIA; radioimmunoasay (RIA), Westem blot analysis and ELIS A A skilled 
artisan can readily adapt known protein/antibody detection methods for use in detemiining 
whether cells express a marker of the present invention and the relative concentration of 
that specific polypeptide expression product in blood or other body tissues. 
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[001 16] la one format, antibodies or antibody fragments, can be used in methods, such 
as Western blots or immunofluorescence techniques to detect the expressed proteins. In 
such uses, it is generally preferable to immobilize either the antibody or proteins on a solid 
support. Suitable soUd phase supports or carriers include any support capable of binding an 
antigen or an antibody. Well-known supports or carriers include g^ass, polystyrene, 
polypropylene, polyethylene, dextran, nylon, amylases, natural and modified celluloses, 
polyacrylamides, gabbros and magnetite. 

[001 17] One skilled in the art will know many other suitable carriers for binding 
antibody or antigen, and will be able to adapt such support for use with the present 
invention. For example, protein isolated from patient cells can be run on a polyacrylamide 
gel electrophoresis and immobilized onto a solid phase support, such as nitrocellulose. The 
support can then be washed with suitable buffers followed by treatment with the detectably- 
labelled antibody. The sohd phase support can then be washed with the buffer a second 
time to remove unbound antibody. The amount of bound label on the solid support can then 
be detected by conventional means and this measurement translated into a level or 
concentration of protein in blood or anothCT body tissue. 

[001 18] The invention also encompasses kits for detecting the pres^ce of a polypeptide 
or nucleic acid corresponding to a marker of the invention in a biological sample, e.g., any 
body fluid including, but not limited to, serum, plasma, lymph, cystic fluid, urine, stool, csf, 
acitic fluid or blood and including biopsy samples of body tissue. For example, the kit can 
comprise a labelled compound or agent capable of detecting a polypq>tide or an mRNA 
encoding a polypq>tide corresponding to a marker of the invention in a biological sample 
and means for determining the amount of the polypeptide or mRNA in the sample, e.g,, an 
antibody which binds the polypeptide or an oUgonucleotide probe which binds to DNA or 
mRNA encoding the polypeptide. Kits can also include instructions for interpreting the 
results obtained using the kit. 

[001 19] For antibody-based kits, the kit can comprise, e.g.^ 1) a first antibody, e.g., 
attached to a solid support, which binds to a polypeptide corresponding to a marker or the 
invention; and, optionally; and optionally 2) a second, different antibody which binds to 
either the polypeptide or the first antibody and is conjugated to a detectable label. 
[00120] For oligonucleotide-based kits, the kit can comprise, e.g.^ 1) an oligonucleotide, 
e.g., a detectably-labeUed oligonucleotide, which hybridizes to a nucleic acid sequence 
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encoding a polypq)tide corresponding to a marker of the invention; or 2) a pair of primers 
useful for amplifying a nucleic acid molectile corresponding to a marker of the invention. 
[001 21] The kit can also comprise, e.g., sl buffering agent, a preservative or a protein- 
stabilizing agent. The kit can further comprise components necessary for detecting the 
detectable-label, e,g,, an enzyme or a substrate. The kit can also contain a control sample or 
a series of control samples, which can be assayed and compared to the test sample. Each 
component of the kit can be enclosed within an individual container and all of the various 
containers can be within a single package, along with instructions for interpreting the 
results of the assays performed using the kit. 

[00122] Kits. The kits of the invention may contain a written product on or in the kit 
. container. The written product describes how to use the reagents contained in the kit to 
determine whether a patient will experience hepatotoxicity during drug treatment. In 
several embodiments, the use of the reagents can be according to the methods of the 
invention. In one embodiment, the reagents are primer pairs for performing PCR analysis of 
ILIA genetic polymoiphisms. 

EXAMPLE 

CLINICAL PHARMACOGENETIC ANALYSIS OF HEPATOTOXICITY 
IN THE CLINICAL TRIAL 

[00123] Demographics of clinical pharmacogenetic analysis participants. Of the 139 
subjects enrolled in the clinical trial, 83 consented to participation in tiie clinical 
pharmacogenetic portion of the clinical trial. This represents about 60% of the total 
population that participated in the clinical trial. The clinical pharmacogenetic analysis 
population was representative of the clinical trial group in terms of age, race and gender. 
Furthermore, the consent rate was comparable for each arm of the trial (placebo, 50, 100 
and 150 mg/day), such that the clinical pharmacogenetic analysis was not biased toward 
one dosage group. No statistically significant differences were observed between the 
demographics of the clinical pharmacogenetic population compared to the overall trial 
population. 
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TABLE 1 

Distribution of clinical phaimacogenetic samples compared to 
flie overall clinical trial samples 





CPG samples 


Trial samples 


AGE (mean,years) 


58-8 


593 


RACE" 






Caucasian 


(70) 84% 


(117) 84% 


Black 


(5) 6% 


(9) 6.4% 


Oriental 


(1) 1.2% 


(2) 1.4% 


Hispanic 


(7) 8.4% 


(9) 7% 


Other 


(0) 0% 


i y|0/ 
(Z) 1.4% 


GENDER 






Male 


( 50 ) 60% 


( 88 ) 63% 


Female 


( 33 ) 40% 


/CI \ "J "70/ 


DOSEf 




0 mg/day 


(23) 28% 


(34) 24% 


50 mg/day 


(20) 24% 


(32) 23% 


100 mg/day 


(20) 24% 


(37) 27% 


150 mg/day 


(20)24% 


(36) 26% 


Hepatotoxicity^ 






Yes 


(6) 7% 


(9) 6% 


No 


(77) 93% 


(130) 94% 



^p<0.7748 (ANOVA) 
^p<0.91 12 (Fisher^s Exact) 
^<0.6698 (Fisher's Exact) 
^<0.9342 (Fisher's Exact) 
'*p<L000 (Fisher's Exact) 



[00124] Blood samples from each patient were collected at individual trial sites and 
shipped to Covance (Geneva, Switzerland), where genomic DNA was extracted using the 
PUREGENB™ DNA Isolation Kit (D-50K) (Gentra, Minneapolis, MN). 
[00125] Genotyping. A total of 18 loci in seven genes were genotyped. SNP assays were 
designed using information from public databases such as OMIM, the SNP Consortium, 
Locus link and dbSNP, and information from Third Wave Technologies, Inc. (TWT, 
Madison, WI). Any loci that were not polymorphic in this trial population were not 
analyzed ftirther. Genotyping was performed with 40-60 ng of genomic DNA using the 
Invader® assay developed by Third Wave Technologies according to the manufacturer's 
instructions, Lyamichev V et aL,Nat Biotechnol 17: 292-6 (1999); Ryan D et aL Mol 
Diagft 4: 135-44 9 (1999) 

[00126] Each SNP interrogated in this trial was assigned a clinical phaimacogenetic 
(CPG) identifier, referred to as the PG locus ID. See TABLE 2 for a list of all SNPs 
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asisayed in this trial, along with their PG locus IDs and details about the location 
polymorphism within the gene of interest 

TABLE 2 

List of genes examined in the clinical phatmaco genetic analysis 
Gene Name 



Ofthe 



Gene 
Symbol 
ABCBJ 



ABCBl 



ABCB2 



CD14 
CYP2D6 



CYP2D6 



CYP2D6 



CYP2D6 



ATF'bmding 
cassette, sub-family B 
(MDR/TAP), member 
1 

ATP'binding 
cassette, sub-family B 
(MDR/TAP), member 
1 

ATF'binding 
cassette, sub-family B 
QdDR/TAP), member 
1 

CD! 4 antigen 
cytochrome P450, 
subfamily IID 
(debrisoquine, 
sparteine, etc, 
-metabolizing), 
polypeptide 6 
cytochrome P450, 
subfamily UD 
(debrisoquine, 
sparteine, etc., 
-metabolizing), 
pofyp^tide 6 
cytochrome P450, 
subfamily JUD 
(debrisoquine, 
sparteine, etc., 
-metabolizing), 
polypeptide 6 
cytochrome P450, 
subfamily UD 
(debrisoquine, 
sparteine, etc,, 
-metabolizing), 
polyp^tide 6 



CPG 
Locus ID 
181 



1006 



1045 



1708 
31 



211 



212 



213 



REF ACC 
M29445 



U00699 
M33388 



M33388 



M33388 



M33388 



Allele 1 
0.47 



AC00S068J 0.63 



AC002457.1 0.48 



0.56 
0.86* 



0.60* 



0.52 



0.83 



Allege 
2 Freq. 
0.53 



0.37 



0.52 



0.44 
0.14* 



0.40* 



0.48 



0.17 



Position 
176 

83646 



312 
3465 



4469 



5799 



1719 



Location 

Exon 26 

ILE1144IL 

B 

Unknoivn 



19981202 Unknovm 



Promoter 
Intron 



Exon 6 

ARG269CY 

S 



Exon 9 

THR486SE 

R 



Exon 1 
PR034SER 
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TABLE 2 

List of ge nes mcflmin ed in the r-litncal pharmaco genetic analysis 





Grene Name 


CPG 


REF ACC 


Allele 1 


Allele 2 


Position 


Location 


bvmboi 




l^cus iiJ 




preq. 


Freq. 








cytOCnTOmB rnOU, 








O.lO* 


20335 


Intron 10 




subjhtnily IIIA 
















(niphedipine 
















oxidase), polypeptide 

4 














ILIA 


interleukin I, alpha 


279 


X03833 


0.72* 


0.28* 


549 


Promoter 


ILIA 


interleukin I, alpha 


302 


X03833 


0.72* 


0.28* " 


6282 


ExonS 


ILIA-^ 


interleukin I, alpha 


303 


X03833 • 


1.00 


0.00 


4282 


£xon4 


NRJI2 


nuclear receptor 


2o4l 




0.62* 


0.38* 


42813 


Promoter 




subfanufy J, group I, 
















member 2 














IwKJJJ 


nuclear r&^eptor 


2042 


ArUOll/DO 


O.OO 


A Afi 

0.40 


232 


Intron 




subfamily I, group I, 
















member 2 
















nuclear receptor 






U.ov 


U.2U 


111 <kA 


3 I/IK 




subfamily I, group I, 
















member 2 














ORMl 


orosomucoid J 


1831 


NT„031830 


0.57* 


0.43* 


191 


Exon 1 




(alpha- 1 -acid 
















glycoprotein) 














ORMl 


orosomucoid I 


1832 


NT_O31830 


0.97 


0.03 


2077 


ExonS 




(alpha-l-acid 
















glycoprotein) 














ORMl 


orosomucoid I 


1833 


NT_031830 


0.96 


0.04 


5041 


Intron 




(alpha-I-acid 
















glycoprotein) 















*This SNP is not in Hardy-Weinberg equilibrium in tins patient population. 
'^This locus is not polymorphic in ^se patient populations and was not used in the analysis. 



[00127] Lod in ABCBl, CD14, ILIA, and ORMl were interrogated by directly assaying 
genomic DNA using Third Wave Technologies technology. Because of high sequence 
homology within the CYP450 family of genes, CYP2D6 was polymerase chain reaction 
(PGR) amplified in three fragments prior to Invader® assay genotypmg to insure specificity. 
Primer sequences for each segment are listed in TABLE 3 along with the region of the gene 
spanned by each primer set. Each amplicon was generated in a 25 |li1 reaction containing 
20-60 ng of genomic DNA, 0.5 fxl of 10 mM dNTPs, 2.5 ^l of lOX PGR Buffer I with 15 
mM MgCl2 (Applied Biosystems, Foster City, GA), 2.5 ^il DMSO, 0.5 \i\ of 20 jiM 
GYP2D6-forward primer, 0.5 of 20 |iM GYP2D6-reverse primer, and 1.25 U Taq DNA 
polymerase (Applied Biosystems). 35 cycles of amplification were performed using the 
following conditions: 94^G, 30 sec; 65*'C, 1 min.; 72'*G, 2 min. Amplification of the 
appropriate product was confirmed on five random samples by fractionation on a 1% 
agarose gel containing ethidium bromide. Amplicons were diluted 1:10 in TE 8.0) 
before genotyping. 
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TABLE 3 
Primers used to amplify CYP2D6 



Exons 3, 4 and 5 
(31 &211) 



Goie region 
(PG locus ID^ 
Exons 1 and 2 
(31) 



Exons 7, 8 and 9 
(212) 



Primer Name Primer Seqiienr.R 
f5'^3'> 

2D6L1F1 CTGGGCTGGGAGCAGCCTC 

(SEQIDN0:5) 
2D6L1R1 CACTCGCTGGCCTGTTTCATGTC 

(SEQIDN0:6) 
2D6L2F CTGGAATCCGGTGTCGAAGTGG 

(SEQIDNO:?) 
2D6L2R2 CTCGGCCCCTGCACTGTTTC 

(SEQIDNO:8) 
2D6L3F GAGGCAAGAAGGAGTGTCAGGG 

(SEQIDN0:9) 
2D6L3R5B AGTCCTGTGGTGAGGTGACGAGG 

(SEQIDNO:10) 



Size 
(by) 
2036 



1683 



1754 



[001 28] PGR for the CYP3A4 gene was paformed in a 25 fxl reaction containing 1 5-30 
ng genomic DNA, 0.4 jil of 10 mM dNTPs, 2.5 ^1 of lOX PGR Bufferl with 15 mM MgCfc 
(Applied Biosystems), 0.75 ^il of 20 }iM CYP3A4-forward primer, 0.75 [il of 20 jxM 
CYP3 A4-reverse primer, and 0.75 U Taq DNA polymerase (Applied Biosystems). 30 
cycles of amplification wa-e performed using the following conditions: 94°C, 30 sec; 58°C, 
30 sec; 72''C, 30 sec. To confirm that the appropriate product was generated, five samples 
were firactionated on a 1% agarose gel containing ethidium bromide and firagment size was 
visualized. Primer sequences are as follows: 

CYP3A4ExonlOF-(5'-TGGATGGCCCACATTCTCG-3'; SEQ ID NO:ll), and 
CYP3A4ExonlOR-(5' CTTCCTACATAGAGTCAGTG-3'; SEQ ID NO: 12). A 1:20 
dilution of the PGR product in TE (pH 8.0) was run against PG locus ID 2325 using a 384 
well biplex plate for ampUfied DNA. 

[00129] Restriction Fragm^t L^gth Polymorphism (KFLP) analysis was used to 
genotype tiiree polymorphic loci in the NR1I2. NR1I2 sequences were first PGR amplified 
in 25 nl reactions containing 15-30 ng genomic DNA, 0.4 ^1 of 10 mM dNTPs, 2.5 jil of 
lOX PGR Buffer I with 15 mM MgCb (Applied Biosystems), 0.50' fil of 20 \jM. 
NRlI2-forward primer, 0.50 nl of 20 |jM NRlI2-reverse primer, and 0.75 U Taq DNA 
polymerase (Applied Biosyst^ns). Sequences for the primer sets used for each assay are 
listed in TABLE 4. Thirty-five cycles of amplification were performed using the following 
conditions: 94°C, 30 sec; 60"C, 30 sec; 72°C, 30 sec. Amplicons ware firactionated on a 3% 
agarose gel containing ethidium bromide. 
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PG locus ID 
(Assay Name) 

2642 

(PXR-252) 



2643 

(PXR-ni56) 



2641 

(PXR-2411S) 



TABLE 4 

RFLP analysis of NR1I2 polymorphisms 



Primer TsJatTtft 

PXR-252-F 

PXR-252-R 

PXR-11156-F 

PXR-11156-R 

PXR-24113-F 

PXR-24113-R 



PrimCT Sequence 
(5 '-3'^ 



Restriction 
Endonuclease 



GGACACAGAGTCT BSMBl 
GTTCCTGG 

(SEQIDNO:13) 
GAAGATGAAGGA 
TTCCTCTGGG 

(SEQIDN0:14) 
GACAAGGCTACGC Ddel 
TGACAATCAG 

(SEQIDNO:15) 
GCTTGCGTATGTT 
TCTATTTCCAC 

(SEQE)NO:16) 
CGGAGCAAAGAA HphI 
CTTACCACC 

(SEQIDNO:17) 
TGCAGGACCAGAG 
AGCATCAG 

(SEQIDNO:18) 



Frag Size 
(bp) / allele 
caU 
318/A 

204 & 114/G 



342/C 

193 & 149/A 



253/G 
195 & 58/A 



[00130] Restriction enzymes used for RFLP analysis of the PGR products are listed in 
TABI£ 4 along with the fiiagment size they produced and the resultant allele call. All 
restriction enzymes were purchased fiom New England Biolabs, Beverly, MA. Reaction 
conditions were as follows: (1) BMSBl digests were performed in a 20 ^1 reaction using 2 
]il of lOX Buffer 3 (Nfew England Biolabs), 8 p.1 anq)lified DNA, and 2U BSMBl enzyme. 
Reaction mixtures were incubated for 4.5 hours at 55''C; (2) Ddel digests were performed in 
a 20 (il reaction using 2 jil of lOX Buffer 3 (New England Biolabs), 8 pii amplified DNA, 
and 4 U of Ddel enzyme. Reactions were incubated for 17 hours at 37°C; (3) HphI digests 
were as Ddel digests except lOX Buffer 4 (New England Biolabs) was used instead of lOX 
Buffer 3. Digested DNA was fractionated (10 ^1) on a 3% agarose gel containing ethidium 
bromide and band size was visualized. 

[00131] Statistical Analysis. Analysis of variance (ANOVA) and Fisher's Exact tests 
were used for the analysis of the effect of genome and hepatotoxicity. All statistical 
analyses were performed using the SAS 8.02 software. To correct for multiple testing, tiie 
Bonferroni correction method was performed (see, below). 
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[001 32] Hepatotoxicity and N-benzoyUstaurosporine metabolism. The relationship 
between the occurrence of hepatotoxicity in the clinical trial and the dose of 
N-benzoyl-staurosporine administered was examined. The percentage of subjects per dose 
group that experienced hepatotoxicity are as follows: 3% for 50 mg/day, 8% for 1 00 
mg/day and 14% of those that took 150 mg/day. While most of the subjects that 
experienced liver toxicity took the highest dose of N-benzoyl-staurosporine, the association 
between of N-benzoyl-staurosporine dose and hepatotoxicity is not significant (p == 0.09; 
Fisher's Exact). 

[00133] Phaimacoldnetic assessments were done in the clinical trial, but the two major 
metaboUtes of N-benzoyl-staurosporine could not be detected in the blood of most subjects 
because they are strongly boimd to the serum protein ai-acid glycoprotein (AGP). AGP is a 
heavily glycosylated protein that is synthesized primarily in the liver and functions as an 
acute phase response protein. Hochepied T et aL, Cytokine Growth Factor Rev 14: 25-34 
(2003); Israili ZH & Dayton PG, Drug Met Rev 33 : 161-235 (2001). An analysis was done 
to see if AGP levels correlated with tiie occmrence of hq)atotoxicity. The maximum AGP 
level from visit 3-5 was used for this analysis, and all participants in the clinical trial, not 
just those that consented to the pharmacogenetics anal3^is, were included. The mean 
maximum concentration of AGP among those that experienced hepatotoxicity was 109.7 
mg/dl, which was significantly higher than 87.8 mg/dl, the mean maximum concentration 
for those that did not experience hepatotoxicity (p=0.046, ANOVA). 
[00134] AGP is encoded by two genes, ORMl and ORM2, which are closely linked on 
chromosome 9q31-34.1. Webb GC et aL, Cytogenet Cell Genet 47: 18-21 (1998). ORMJ is 
highly polymorphic (Yuasa I et aL, Hum Genet 99: 393-8 (1997)), and three SNPs m 
ORMl were interrogated in this trial (PG locus IDs 1831, 1832 and 1833). No association 
was seen between genotypes at the three ORMl loci examined and levels of AGP detected 
in the serum. See, TABLE 5. 

TABLES 

ORMl SNPs do not associate with maximimi AGP levels. 

PG locus ID AGP Max (p value, ANOVA^ 

1831 0.51 

1832 0.07 

1833 0.95 



37 



wo 2005/040415 



PCT/EP2004/011123 



[00135] Furfhennore, the ORMl SNPs interrogated in this analysis did not associate 
with hepatotoxicity. See, TABLE 6, below. To examine possible associations between the 
occurrence of hepatotoxicity and SNPs in CYP2D6, CYP3A4, ABCBl and NR1I2 more 
thoroughly, each liv^ enzyme was examined independently. The maximum serum alanine 
aminotransferase and aspartate transaminase values recorded on visits 3-S were analyzed 
using ANOVAs- No associations were found between SNPs in genes that contribute to the 
metabolism and distribution of N-benzoyl-staurosporine and the maximum elevation of 
either alanine aminotransferase or aspartate transaminase. No significant association was 
foimd between maximum alanine aminotransferase or aspartate transaminase levels and the 
* Q/?M7 polymorphisms. 

TABLE 6 



Association between hepatotoxicity^ ALT Max and AST Max and 17 SNPs located in 6 







different genes 






Gene 


PG Locus ID 


Overall 


ALT Max 


AST Max 






Hepatoxicitv 


(n value: 


fp value: 






call fp value: 


ANOVA> 


ANOVA> 






Fisher's Exacts 






ABCBl 


181 


0.59 


0.55 


0.32 


ABCBl 


1006 


1.00 


0.63 


0.24 


ABCBl 


1045 


0.50 


0.35 


0.09 


CD14 


1708 


1.00 


0.87 


0.90 


CYP2D6 


31 


1.00 


0.99 


0.63 


CYP2D6 


211 


1.00 


0.85 


0.66 


CYP2D6 


212 


0.46 


0.16 


0.52 


CYP2D6 


213 


0.22 


0.17 


0.59 


CYP3A4 


2325 


0.72 


0.38 


0.49 


ILIA 


279 


0.23 


0.37 


0.031 


ILIA 


302 


0.29 


0.38 


0.029 


NR1I2 


2641 


0.82 


0.56 


0.21 


NR1I2 


2642 


0.84 


0.28 


0.86 


NR1I2 


2643 


1.00 


0.57 


0.73 


ORMl 


1831 


0.61 


0.72 


0.76 


ORMl 


1832 


0.24 


0.80 


0.82 


ORMl 


1833 


1.00 


0.96 


0.64 



[001 36] In summary, the data in TABLES 5 and 6 did not provide evidence that the 
hepatotoxicity observed in the clinical trial was the result of varied exposure to 
N-benzoyl-staurosporine. 

[00137] However, N-benzoyl-staurosporine is metabolized by CYP3A4 and CYP2D6^ 
and is a substrate of the p-glycoprotein pump, encoded hy ABCBL CYP3A4 is not very 
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polymorphic, and genetic variants rarely account for differences seen in the fimcticn of the 
protein. Spnrdle AB et ah. Pharmacogenetics 12: 355-66 (July 2002) and internal analysis. 
Also, transcription of CYP3A4 is regulated by the pregnane X receptor, PXR (Goodwin B 
et aL, Annu Rev Pharmacol Toxicol 42: 1-23 (2002)), which is encoded by a polymorphic 
gene called JVRi/2. Accordingly, polymorphisms in the following genes that contribute to 
N-benzoyl-staurosporine metaboUsm were examined: CYP2D6 (PG locus IDs 31, 21 1, 212, 
213); CYP3A4 (PG locus ID 2325); NR1I2 (PG locus IDs 2641, 2642, 2643); and ABCBl 
(PG locus IDs 181, 1006, 1045). The occurrence of hepatotoxicity in the clinical trial was 
not found to be associated with any of the polymorphisms interrogated in these four genes. 
[00138] Hepatotoxicity and genes associated with idiosyncratic mechanisms of liver 
injury, Hepatotoxicity observed during the clinical stages of drug development is oftoti 
associated with levels of exposure to the drag. However, some subjects may experience 
liver enzyme elevations because something about their intrahepatic milieu (levels of 
cytokines or razymes tiiat neutralize £ree radicals, for example) promotes toxicity of the 
compound. These mechanisms of liver toxicity are referred to as "idiosyncratic". Finding 
an association between genetic polymorphisms and hepatotoxicity arising from 
idiosyncratic mechanisms is not very likely because hepatotoxicity is a relatively rare event 
that can be influenced by multiple signaling pathways. 
[00139] Hepatotoxicity and genes associated with inflammatory response. 
Polymorphisms in two genes tiiat could contribute to acute phase responses in the liv^, 
CD 14 and ILIA, were examined in this trial. CD 14 regulates release of inflammatory 
cytokines from Kuppfer cells (Jarvelainen HA et aL, Hepatology 33: 1148-53 (2001)) and 
ILIA is an important mediator of responses to tissue damaging agents. Ramadori G & 
Christ B, Semin Liver Dis 19: 141-55 (1999), Two polymorphic loci in ILIA were 
interrogated in this trial (PG locus IDs 279 and 302) and three loci were investigated in 
CD14 (PG locus IDs 2641, 2642 and 2643). No associations were found between the loci 
we studied and the overall hepatotoxicity call in the clinical trial. See, TABLE 5. 
[00140] Next, SNPs in ILIA and CD14 were analyzed for their possible association with 
elevations in each liver enzyme independently. The maximum serum aspartate transaminase 
or alanine aminotransferase value recorded between visits 3 and 5 was associated with 
genotypes at the loci of interest in ILIA and CD14 using an ANOVA. No associations were 
foimd between alanine aminotransferase and SNPs in ILIA or CD 14. 



39 



wo 2005/040415 



PCT/EP2004/011123 



[00141] However, both ILIA SNPs, PG locus IDs 279 and 302, were associated with the 
maximum serum aspartate transaminase value recorded on visits 3, 4, or 5 (p = 0.031 and 
0.029, respectively). PG locus ID 279, a C->T transition in the promoter of ILIA (position 
549 in GenBank accession number X03833), has been reported to be in linkage 
disequilibrium with PG locus ID 302. Jouvenne P et al, Eur Cytokine Netw 10: 33-6 
(1999). Our findings support strong linkage between these two loci (99.99%). PG locus ID 
302 is a G->T base change in exon 5 (position 6282 in GenBank accession number 
X03833) that results in an amino acid substitution of alanine to serine. For each of these 
SNPs, the TT genotype is rare; only two individuals are TT for PG locus ID 279 and 302. 
For this reason, we analyzed TT homozygous individuals together with the GT (PG locus 
ID 302) or CT (PG locus ID 279) heterozygotes. Thus, for PG locus ID 279, subjects with a 
CC genotype were compared with subjects with a T at that locus (eiflier CT or TT). For PG 
locus ID 279, the average maximum serum aspartate transaminase value for CC individuals 
was 37.1 U/L, while T (CT and TT) individuals had a significantly lower average 
maximum serum aspartate transaminase value of 23.1 U/L ^=0.0089, ANOVA). Likewise, 
for PG locus ID 302, subjects who were GG were compared to Ts (subjects who were either 
GT or TT). When the data were grouped using this approach, a much stronger association 
was seen between genotype at each locus and the occurrence of hepatotoxicity. For PG 
locus ID 302, GG individuals had an average maximum serum aspartate transaminase level 
of 36.6 U/L, which was significantly higher than 22.9 U/L, the average maximmn serum 
aspartate transaminase level for T (CT and TT) individuals (p= 0.0097, ANOVA). The 
scatt^lots in FIG. 1 and FIG. 2 show that the subjects with the highest maximum serum 
aspartate transaminase values while taking N-benzoyl-staurosporine were CC at PG locus 
ID 279 and GG at PG locus ID 302. 

[00142] Because multiple SNPs were tested for tihieir association witti hepatotoxicity in 
the clinical trial, a correction factor was applied to the results. The Bonferroni correction 
method requires that p values be adjusted by a factor of 17 (the number of polymorphic 

SNPs interrogated in this trial). Bonferroni = = = 0.0029, where 

TJ 17 

7/ = PCKA\2 _number _of _tests . Using this correction factor, no SNP with an 
association with p ^ 0.0029 is significant. Associations between ILIA polymorphisms and 
serum aspartate transannnase levels had p values of 0.0089 and 0.0097. 
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[00143] In summary, subjects who are CC at PG locus 279 and GG at PG locus 302 
within the ILIA gene are more likely to experience elevated serum aspartate transaminase 
levels after taking N-benzoyl-staurosporine. While the average aspartate transaminase 
maximum for CC subjects was not over the upper limit of normal, a scatterplot of the data 
demonstrates that all but one of the subjects that had serum aspartate transaminase levels 
over the upper limit of noraial were CC for PG locus ID 279 (FIG. 1) and GG for PG locus 
ID 302 (FIG. 2). 

[00144] Drag-induced liver toxicity is often characterized by inflanmiation. Jaeschke H 
et al, Toxicol Sci 65: 166-76 (2002). The interleukin 1 family of proteins is known to have 
multiple biological activities and are key regulators of the response to infection and injury. 
Like other proinflammatory cytokines, ILIA induces NF-kB activity, thereby increasing 
the transcription of cytokine-inducible genes. The effects of ILIA are mediated by the 
induction of other c3^okines, including granulocyte colony-stimulating factor, tumour 
necrosis factor alpha (TNF), interleukin 6, interleukin 8, and platelet-derived growth factor. 
Paul WE, Fundamental Immunology, Fourth Edition (Lippingqott-Raven Publishers, 
Philadelphia, PA, 1999). Notably in mouse models, ILIA has been shown to act 
synergistically with TNF to induce liver injury. Nagakawa J et al, Immunopharmacol 
Immunotoxicol 13: 485-98 (1991). 

[00145] In addition to being associated with hepatotoxicity, the interleukin 1 family of 
proteins have been implicated in the induction of insulin secretion and stimulation of 
apoptosis in pancreatic P cells. Paul WE, Fundamental Immunology, Fourth Edition 
(Lippingcott-Raven Publishers, Philadelphia, PA, 1999). This is relevant for this trial since 
the clinical trial participants had either type I or type n diabetes. The production of BLl by 
local inflanmiatory cells during the autoinmaune process in insulin-dependent diabetes 
mellitus (type I diabetes) may contribute to the destruction of pancreatic p cells. 
Mandrup-Poulsen T a/.. Cytokine 5: 185-81 (1993). Interleukin 1 family members have 
been shown to induce the production of nitric oxide (NO) by pancreatic islets, and reports 
in the literature support a role for NO in diabetes development. Furthermore, treatment of 
the pancreatic p-celHike line RIN-5AH with ILIA resulted in apoptosis and necrosis 
within 4 hours after treatment. Vassiliadis S et al. Mediators Inflamm 8: 85-91 (1999). 
Interestingly, ILIA was shown to induce the expression of PKC in RIN-5AH cells by 30%. 
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Therefore, in addition to influencing hepatotoxicity, ILIA may influence the efficacy of 
N-benzoyl-staurosporine in diabetic patients. 

[00146] All references cited herein are incorporated herein by reference in their entirety 
and for all purposes to the same extent as if each individual publication or patent or patent 
application was specifically and individually indicated to be incorporated by reference in its 
entirety for all purposes. In addition, all GenBank accession numb^, Unigene Cluster 
numbers and protein accession numbers cited herein are incorporated herein by reference in 
their entirety and for all purposes to the same extent as if each such number was 
specifically and individually indicated to be incorporated by reference in its entirety for all 
purposes. 

[001 47] The present invention is not to be limited in terms of the particular 
embodiments described in this application, which are intended as single illustrations of 
individual aspects of the invention. Many modifications and variations of this invention can 
be made without departing fix)m its spirit and scope, as will be apparent to those skilled in 
the art. Functionally equivalrat methods and sqpparatus within the scope of the invention, in 
addition to tiiose enimierated herein, will be s^parent to those skilled in the art &om the 
foregoing description and accompanying drawings. Such modifications and variations are 
intended to fall within the scope of the upended claims. The present invention is to be 
limited only by the terms of the appended claims, along witii the full scope of equivalents to 
which such claims are entitled. 
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